Name	Name	Last commit message	Last commit date
Latest commit zjzser Update README.md Dec 22, 2024 22c0f7f · Dec 22, 2024 History 14 Commits
modules	modules	Delete modules/__pycache__ directory	Oct 21, 2024
quantization	quantization	Add files via upload	Oct 21, 2024
README.md	README.md	Update README.md	Dec 22, 2024
config.json	config.json	Add files via upload	Oct 21, 2024
env.py	env.py	Add files via upload	Oct 21, 2024
inference.py	inference.py	Add files via upload	Oct 21, 2024
meldataset.py	meldataset.py	Add files via upload	Oct 21, 2024
models.py	models.py	Add files via upload	Oct 21, 2024
msstftd.py	msstftd.py	Add files via upload	Oct 21, 2024
pooling_layers.py	pooling_layers.py	Add files via upload	Oct 21, 2024
requirements.txt	requirements.txt	Add files via upload	Dec 22, 2024
resnet.py	resnet.py	Add files via upload	Oct 21, 2024
train.py	train.py	Update train.py	Nov 16, 2024
utils.py	utils.py	Add files via upload	Oct 21, 2024
watermark.py	watermark.py	Add files via upload	Oct 21, 2024

Repository files navigation

TraceableSpeech

PyTorch Implementation of TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Now we open the part of speech watermarking.

This is the watermark training pipeline.

Quick Started

Dependencies

pip install -r requirement.txt

Default Preparation

We are using the LibriTTS dataset.

Modify the parameter --input_training_file --input_validation_file --checkpoint_path in train.py

Modify the parameter --input_wavs_dir --output_dir --checkpoint_file in inference.py

Modify the config.json

Watermark config

The watermark configuration is in the watermark.py file, defaulting to 4-digit base-16.

Train

python train.py

Test

python inference.py

Acknowledgements

This implementation uses parts of the code from the following Github repos: AcademiCodec

Citations

If you find this code useful in your research, please consider citing:

@misc{zhou2024traceablespeechproactivelytraceabletexttospeech,
      title={TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking}, 
      author={Junzuo Zhou and Jiangyan Yi and Tao Wang and Jianhua Tao and Ye Bai and Chu Yuan Zhang and Yong Ren and Zhengqi Wen},
      year={2024},
      eprint={2406.04840},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2406.04840}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TraceableSpeech

Quick Started

Dependencies

Default Preparation

Watermark config

Train

Test

Acknowledgements

Citations

About

Releases

Packages

Languages

zjzser/TraceableSpeech

Folders and files

Latest commit

History

Repository files navigation

TraceableSpeech

Quick Started

Dependencies

Default Preparation

Watermark config

Train

Test

Acknowledgements

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages