Releases · daswer123/xtts-api-server

02 Jun 02:14

daswer123

0.9.0

792d0fb

0.9.0 Latest

Latest

0.9.0

Fix scapy problem
Switch from original tts to coqui-tts fork for version 0.24.1

Assets 2

02 Jan 16:22

daswer123

0.8.0

e1a458e

0.8.6

Improved work with cache
Fixed a bug related to the same response if requests were made at the same time

Thanks for the changes @chanis2 more info here #62

0.8.5

Fixing a potential security issue, thanks #61

0.8.4

Update requerments for tranformers lib

0.8.3

Remake Docker, now it works
UPD 19.01.24 - Thanks @mickdekkers for improving the docker

0.8.2

Increased the maximum chunk size for streaming
Correct README

0.8.1

Added the ability to customize the chunk size for streaming

0.8.0

Added new endpoints that allow you to: change the folder where the models are stored, change the model without restarting the server and endpoint to change the generation parameters.
In version 0.7.6 an endpoint for streaming was added, I advise you to pay attention to it #37

Contributors

mickdekkers and chanis2

Assets 2

21 Dec 00:37

daswer123

0.7.0

fc033f1

0.7.6

A new endpoint has been added that allows you to get streaming audio.
Streaming integration in SillyTavern SillyTavern/SillyTavern#1623.

Thanks to @Cohee1207 for all the changes in this update

0.7.5

Simplified loading custom models, now you need to create a models folder in the root and put your model in this folder, in the models folder there should be 3 files model.pth, vocab.json, config.json

Specify with a flag the name of the folder with your model, for example -v warcraft3

Removed an unnecessary warning

0.7.4

Connect Pypi and github https://pypi.org/project/xtts-api-server/

0.7.3

Added a --listen flag, this makes it easier to understand and use outside the local pc
Now you don't have to use -t , the script will try to automatically detect your local ip address and use it for the preview

0.7.2

Fixed the error that occurred if you specify the path to the speaker file

0.7.1

Added check for old format of model records, e.g. 2.0.2, they will be automatically brought to the correct form, e.g. from 2.0.2 to v2.0.2

0.7.0

You can load a custom model, for this you just need to create a folder in the models folder and put there 3 files vocab.json config.json model.pth after that you need to specify the name of this folder using the -v "Model Name" flag.
You can specify the path to a text file as an input, to do this, pass the path to the file with the .txt ending to the text field
Now you can cache the results so that you don't have to wait for a new generation when you make another request, use the new --use-cache flag to do this

Contributors

Cohee1207

Assets 2

04 Dec 00:41

daswer123

0.6.0

51e1a86

0.6.8

Added support for multiple sample speakers for input. thanks @lendot for the update

0.6.7

Added a check for the correct source
Fixed a bug with deepspeed

0.6.6

Fixed a bug with --streaming-mode-improve
Added support for deepspeed, can be enabled via flag --deepspeed will automatically download necessary libraries

0.6.5

Added the ability to synchronously play audio in streaming mode via new flag --stream-play-sync

0.6.4

Fixed error related to multiprocessing, thanks @lendot

0.6.3

Updated the RealtimeTTS library to 3.3.32
Fixed an issue with dependencies due to tts update

0.6.1

Updated the RealtimeTTS library to 3.3.32
Added info on webui to fine tune xtts

0.6.0

Added the ability to select the device to work, -d flag

Contributors

lendot

Assets 2

30 Nov 22:54

daswer123

0.5.7

1ef056e

0.5.9

Update TTS version to >=0.21.2

0.5.8

Fixed low generation quality on -ms local

0.5.7

Thanks @sharockys for helping me set up autodeploy on docker
After 10 comits, I think I got it set up on pypi.

Contributors

wang-haoxian

Assets 2

29 Nov 03:25

daswer123

0.5

1f3632f

0.5.6

0.5.0

Streaming mode has been added more details here https://github.com/daswer123/xtts-api-server#about-streaming-mode #10
Docker updated

0.5.1

Changed folder for the model that they would be in synergy with RealtimeTTS
Added information at startup about streaming mode

0.5.2

Fix bug when models download twice

0.5.3

Updated the RealtimeTTS library to version 0.3

0.5.4

Updated the RealtimeTTS library to version 0.31
Now in streaming mode you can interrupt the current stream and start a new one

0.5.5

Reduced the wait time after you have interrupted a response to 0.1 seconds
Time is still needed to avoid stuttering.

0.5.6

Added a new flag -streaming-mode-improve which triggers an improved version of streaming, more details in REAME
Removed the timer when we interrupt the stream and start a new one, thanks to the update 0.32.0 now it does it by itself and no more errors thanks @KoljaB for the quick fix in RealtimeTTS
Updated the RealtimeTTS library to version 0.3.2
Added a stream2sentence check

Contributors

KoljaB

Assets 2

27 Nov 08:21

daswer123

0.4.0

273bf43

0.4.5

Added changlog information

0.4.4

Slightly improved the operation of --lowvram
Added possibility to select model version via -v flag

0.4.3

Added the --lowvram flag which allows you to keep the model in RAM and load into VRAM only during the conversion

0.4.2

Changed the default model-source option to apiManual
Added apiManual option, which works like normal api but with model 2.0.2.

0.4.1

Сhanged the name of the options in the model_source flag to make it clearer. Now it local and api

0.4.0

Docker support has been added
Added model loading method,
Added TTS version check
Changed GET request from speakers_default to speakers_list for clarity
Updated note for creating samples for cloning
Fixed bug with code 307

For contributions and code samples, I'd like to thank @sharockys and @erew123

Contributors

wang-haoxian and erew123

Assets 2

23 Nov 04:47

daswer123

0.3.0

8dbda5b

0.3.2

0.3.0
Fix for Japanese
Fixed some silly comments in the code
Now the file is returned instead of the stream, it seems to have improved the playback speed in SillyTavern.

0.3.1
Fixed voice preview display when using google colab

0.3.2
Add Hindi support

Assets 2

21 Nov 10:39

daswer123

0.2.5

c088b20

0.2.5

Fix model loading

Assets 2

21 Nov 08:56

daswer123

0.2

9330c8b

0.2

Added a GET to get the speaker for SillyTavern
Adapted for SillyTavern

Assets 2

Releases: daswer123/xtts-api-server

0.9.0

0.9.0

0.8.6

0.8.6

0.8.5

0.8.4

0.8.3

0.8.2

0.8.1

0.8.0

Contributors

0.7.6

0.7.6

0.7.5

0.7.4

0.7.3

0.7.2

0.7.1

0.7.0

Contributors

0.6.8

0.6.8

0.6.7

0.6.6

0.6.5

0.6.4

0.6.3

0.6.1

0.6.0

Contributors

0.5.9

0.5.9

0.5.8

0.5.7

Contributors

0.5.6

0.5.0

0.5.1

0.5.2

0.5.3

0.5.4

0.5.5

0.5.6

Contributors

0.4.5

0.4.5

0.4.4

0.4.3

0.4.2

0.4.1

0.4.0

Contributors

0.3.2

0.2.5

0.2