-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speech Note 4.6.0 Beta 2 #145
Comments
.mp3 not supported which seems ridiculous for a program that's main feature is to export .mp3. How do we submit examples of what we are talking about? _We don’t support that file type. Try again with GIF, JPEG, JPG, MOV, MP4, PNG, SVG, WEBM, CPUPROFILE, CSV, DMP, DOCX, FODG, FODP, FODS, FODT, GZ, JSON, JSONC, LOG, MD, ODF, ODG, ODP, ODS, ODT, PATCH, PDF, PPTX, TGZ, TXT, XLS, XLSX or ZIP._ |
Sorry for the very late reply. MP3 format is supported for both import and export. If that doesn't work, could you create a separate "issue" for this problem? Thanks. |
Release 4.6.0 is out, so closing. |
I was talking about the github site, not the program
…On Sat, Aug 3, 2024 at 7:14 AM mkiol ***@***.***> wrote:
@gbodley <https://github.com/gbodley>
.mp3 not supported
Sorry for the very late reply. MP3 format is supported for both import and
export. If that doesn't work, could you create a separate "issue" for this
problem? Thanks.
—
Reply to this email directly, view it on GitHub
<#145 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALBEH6N435JFSG4JEMQYDJDZPTJS7AVCNFSM6AAAAABJXNITHSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENRWG4YDSMRYGQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Ha ha, now that makes sense :) I was really worried that a key feature was not working. |
No. I wanted to post an example of the speech output of the program, but
mp3 is not allowed to be submitted by github for some unknown reason.
…On Sat, Aug 3, 2024 at 9:40 AM mkiol ***@***.***> wrote:
I was talking about the github site, not the program
Ha ha, now that makes sense :) I was really worried that a key feature was
not working.
—
Reply to this email directly, view it on GitHub
<#145 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ALBEH6PTH2GKNPG7XZWOUFTZPT2XTAVCNFSM6AAAAABJXNITHSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENRWHAZTMNRWG4>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
If you want to play and test the upcoming release, Speech Note 4.6.0 Beta 2 is available in "flathub-beta" repository. To enable "flathub-beta" follow these instructions.
Make sure to update the add-on to version 1.2.0 if you are using it.
Release Highlights:
All changes (compared to version 4.5.0):
Models that provide multiple sub-models (for example, TTS models
that provide different voices) are shown in groups. This makes it
easier to find models in the model browser.
'WhisperCpp' to better reflect the engine behind them.
To automatically detect the language during STT, select one of
the models that is in the 'Auto detected' category in
the language list.
The configuration of each engine has been separated in the settings.
You can separately set the parameters for 'WhisperCpp' and
'FasterWhisper'. The new configuration parameters that have been
added to the settings are: 'Number of simultaneous threads',
'Beam search width', 'Audio context size', 'Use Flash Attention'.
Optimization for short sentences has been added to 'WhisperCpp'.
With it, the speed of STT has doubled!
With OpenVINO decoding on CPU is much quicker. If you are not using
GPU acceleration, it is recommended to enable OpenVINO in
'WhisperCpp' engine settings.
Currently, OpenVINO is enabled only for CPU acceleration.
New settings option allows inserting processing related
information to the text after decoding, such as processing time and
audio length. This can be useful for comparing the
performance of different models, engines and their parameters.
Control tags allow you to dynamically change the speed of
synthesized text or add silence between sentences.
To use control tags, insert '{speed: 0.5}' or '{silence: 1s}'
into the text. For convenience, you can also insert
predefined control tags using text context menu 'Insert control tag'.
The 'Translate', 'Switch languages' and 'Add' buttons have been
placed between text areas which is more convenient.
Latvian to English, Danish to English
The text was updated successfully, but these errors were encountered: