This repo has two programs.
-
record is used to record text and audio commands into parquet files.
-
finetune is used to finetune a Whisper model with the recorded data.
See each folder how you can use these in conjuction to finetune the Whisper speec-to-text model for your robotics usecase.