Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speech Recognition #8

Open
7 tasks
laurentlb opened this issue Oct 27, 2024 · 0 comments
Open
7 tasks

Speech Recognition #8

laurentlb opened this issue Oct 27, 2024 · 0 comments

Comments

@laurentlb
Copy link
Owner

laurentlb commented Oct 27, 2024

I've built a prototype using the speech recognition browser API. It works, but there are some things to improve:

  • Detect when a browser doesn't support the API
  • Manual mode: click on an icon each time you want to speak
  • Stop the speech recognition when the user clicks on something else (e.g. next, restart the story...)
  • Animate the microphone icon when it's actively listening
  • Disable speech recognition when audio is playing (otherwise the microphone may capture the TTS)
  • Make possible to split a sentence into multiple chunks (combine the outputs, and see if the user said the full sentence). This would help with handling longer sentences.
  • Collect feedback. If anyone sees this message, please let me know how the feature works for you.

Note that the browser API is experimental. Not all languages/browsers are supported.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant