Add simple-tts example #12261

danemadsen · 2025-03-08T01:02:06Z

This differs from the existing tts example by not relying on common.h and instead only using llama.h.

danemadsen · 2025-03-08T02:45:19Z

I have no idea why that last one is failing?

ggerganov

I suppose if we merge this version, we can remove the existing common-based tts example.

ggerganov · 2025-03-08T08:02:22Z

examples/simple-tts/simple-tts.cpp

+
+    std::vector<llama_seq_id> seq_ids(n_parallel, 0);
+    for (int32_t i = 0; i < n_parallel; ++i) {
+        seq_ids[i] = i;
+    }
+


Note the parallel functionality is incomplete. We can either remove it to make the example simpler, or we can extend the example to support it. The latter is relatively easy to do - just store multiple sets of codes - one for each parallel sequence. And after that, generate multiple audio files - one for each set of codes.

But this can be done later to keep this PR simple - just making a note.

Yes. Also n_predict and n_parallel are constant values so they can probably be removed but I kept them in to align with the original example.

I tried to get rid of parallel processing like you suggested but it breaks the example and i cant get it to work without it. I added a not to the top of main for future devs.

Didn't want to waste too much time on it anyway as @ngxson suggests the example might not be needed at all.

ngxson · 2025-03-08T13:44:50Z

Tbh I think this is not needed, the tts support is very experimental for the moment, and having 2 examples for tts only adds more complexity to support for more tts models to come in the future. (Also please note that, the whole tts logic is currently built around OuteTTS. Many other tts models have different mechanism)

We should focus on faster R&D for now by relying on common, then later decide whether to move tts into a fully functional example (or even a shared library)

danemadsen · 2025-03-11T04:43:06Z

Though the experimental nature of TTS is a valid reason to keep using common.h, i feel that this example benefits downstream developers who are trying to build api's for this feature. I'm currently trying to implement this in dart and its very hard to do so when the only example supplied uses C++ code.

danemadsen added 13 commits March 7, 2025 22:06

start on simple-tts

0a001f1

functions

6ed172c

prompt

6429dbe

model and context load

2b206e2

sampling and speaker parsing

4f40f01

prompt functions

0b864c4

first sync

6b25ae9

simple-tts

d39b6f9

readme

9c72833

redefinitions

8690eab

greedy

31127b8

fix

e3de627

remove temp .gitignore

b97e333

github-actions bot added the examples label Mar 8, 2025

danemadsen added 8 commits March 8, 2025 11:05

unused functions

5a3155b

CI fixes

d087b13

Merge branch 'ggml-org:master' into master

0ce5d32

deprication fix?

67ee4d9

make batch_add static

75b0b00

header

26b9751

whitespace

08a1df1

_USE_MATH_DEFINES

40c021f

ggerganov reviewed Mar 8, 2025

View reviewed changes

add note

fb28cd6

danemadsen requested a review from ggerganov March 11, 2025 04:22

danemadsen added 2 commits March 11, 2025 14:22

whitespace

fac1756

Merge branch 'ggml-org:master' into master

8fb16a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add simple-tts example #12261

Add simple-tts example #12261

danemadsen commented Mar 8, 2025 •

edited

Loading

danemadsen commented Mar 8, 2025

ggerganov left a comment

ggerganov Mar 8, 2025

danemadsen Mar 8, 2025

danemadsen Mar 11, 2025

ngxson commented Mar 8, 2025 •

edited

Loading

danemadsen commented Mar 11, 2025

Add simple-tts example #12261

Are you sure you want to change the base?

Add simple-tts example #12261

Conversation

danemadsen commented Mar 8, 2025 • edited Loading

danemadsen commented Mar 8, 2025

ggerganov left a comment

Choose a reason for hiding this comment

ggerganov Mar 8, 2025

Choose a reason for hiding this comment

danemadsen Mar 8, 2025

Choose a reason for hiding this comment

danemadsen Mar 11, 2025

Choose a reason for hiding this comment

ngxson commented Mar 8, 2025 • edited Loading

danemadsen commented Mar 11, 2025

danemadsen commented Mar 8, 2025 •

edited

Loading

ngxson commented Mar 8, 2025 •

edited

Loading