You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Generate a training set (list of arbitrary sentences that user can read out loud) to re-train your own deepspeech model on additional datasets, and to tailor it to your voice.
Allow user to easily retrain with a nice interface.
Set up hardcoded voice commands for basic tasks, such as: add a task, due date, priority, project, show tasks of project .. on screen. Give summary of tasks of project x.
Create a training set to retrain the deepspeech model on these hardcoded commands.
Allow user to easily retrain with a nice interface.
Create a neural network/AI that performs interpretation of natural speech commands (and asks for clarification on low confidence inputs). Example: "Add a project that is due next tuesday at 1500 for the swimming project with content: email all coaches and priority medium."
Ask user if they want to help improve the taskwarrior natural language processing model, by providing spoken task descriptions/taskwarrior commands along with the actual typed/digitial/exact commands. (Either allow:
Users to provide a audio fragment and a typed taskwarrior command.
Users to provide a audio fragment and a automatically registered taskwarrior command. (Allow users to specify a single easily understandable signal that can be used to tell the agent that the command was interpreted wrongly. E.g. saying:"WRONG". That way the label can be can be set to: Not this taskwarrior command.
Do not share audio, and only share natural language processing interpretation of voice command.
As soon as time permits, apply differential privacy to ensure the natural language processing does not train on actual taskwarrior commands but on encrypted taskwarrior commands. This to preserve user privacy whilst keeping identical performance (at the cost of (significant) loss of model understandability).
Support modularity of speech recognition models.
Support modularity of natural language interpretation models.
Allow a single integrated model that goes directly from audio to taskwarrior commands.
Set up the deepspeech over tor such that you can connect to it from your phone from anywhere in the world regardles of your local networking situation.
Included a slimmed down deepspeech model to run with AI accelerator chips on your phone to speed up the response time of deepspeech, if the speech/inference delay is too large to make it feel natural.
Set up tor connection on android that transmits speech signals to your own deepspeech server.
Set up a local server running the deepspeech model (docs) of Mozilla common voice for the language preferred by the taskwarrior user.
Generate a training set (list of arbitrary sentences that user can read out loud) to re-train your own deepspeech model on additional datasets, and to tailor it to your voice.
Set up hardcoded voice commands for basic tasks, such as: add a task, due date, priority, project, show tasks of project .. on screen. Give summary of tasks of project
x
.Create a neural network/AI that performs interpretation of natural speech commands (and asks for clarification on low confidence inputs). Example: "
Add a project that is due next tuesday at 1500 for the swimming project with content: email all coaches and priority medium.
"Ask user if they want to help improve the taskwarrior natural language processing model, by providing spoken task descriptions/taskwarrior commands along with the actual typed/digitial/exact commands. (Either allow:
As soon as time permits, apply differential privacy to ensure the natural language processing does not train on actual taskwarrior commands but on encrypted taskwarrior commands. This to preserve user privacy whilst keeping identical performance (at the cost of (significant) loss of model understandability).
Support modularity of speech recognition models.
Support modularity of natural language interpretation models.
Allow a single integrated model that goes directly from audio to taskwarrior commands.
Set up the deepspeech over tor such that you can connect to it from your phone from anywhere in the world regardles of your local networking situation.
Included a slimmed down deepspeech model to run with AI accelerator chips on your phone to speed up the response time of deepspeech, if the speech/inference delay is too large to make it feel natural.
Set up tor connection on android that transmits speech signals to your own deepspeech server.
Allow the user to transmit over clearnet iso tor to speed up response times if desired: https://metrics.torproject.org/onionperf-latencies.html
The text was updated successfully, but these errors were encountered: