Persian/Farsi text to speech(TTS) training using coqui tts
This repository contains sample codes for training text to speech models
Feel free to ask your questions issues
Sample codes and notebooks are available at recepies folder
These are models you can use to test or finetune
- Share your trained models here
- 🤗 huggingface Demo https://huggingface.co/spaces/Kamtera/Persian-tts-CoquiTTS
Models trained on these datasets :
-
https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset
-
https://www.kaggle.com/datasets/magnoliasis/persian-tts-dataset-famale
-
If you'v created a dataset or found any good datasets on the web you can share with us here.
- predict one text from commandline
tts --text "شیش سیخ جیگر" --model_path "best_model.ckpt" --config_path "config.json"
- From python API
from TTS.config import load_config
from TTS.utils.manage import ModelManager
from TTS.utils.synthesizer import Synthesizer
model_path ="config.json" # Absolute path to the model checkpoint.pth
config_path ="best_model.pth" # Absolute path to the model config.json
text=".زندگی فقط یک بار است؛ از آن به خوبی استفاده کن"
synthesizer = Synthesizer(
model_path, config_path
)
wavs = synthesizer.tts(text)
synthesizer.save_wav(wavs, 'sp.wav')