Giter Club home page Giter Club logo

fch-tts's Issues

日文转语音

你好,请问模型有日文的吗?我看现在项目上的模型是ljspeech的,用这个来转日文是会报错的,报错如下:
RuntimeError: Calculated padded input size per channel: (5). Kernel size: (7). Kernel size can't be greater than actual input size
假如要自己训练日文模型,下载 JPSpeech-1.1.tar.bz2时失败,链接无法打开,有其他方式可以获取吗?目前只能下载 jsut_ver1.1 数据集,里面只有wav音频,下载对应的basic5000的lab标注信息,并没有metedata.csv文件

藏语文本的处理

您好!很兴奋能够看到有一个支持藏语语音合成的tts系统,我很好奇您是如何处理藏语文本的。具体来说就是您是使用了藏语字符作为输入,还是说是将藏语转成了音素进行输入?如果转成音素的话,您是基于规则做的,使用的是由了lexion还是您选择了别的方法转成了音素?最后关于藏语转成音素以及对应声调的问题,您这边处理了吗,或者说是您这边有什么好的方法进行藏语转音素以及对应的声调呢!

untimeError

untimeError: Calculated padded input size per channel: (5). Kernel size: (7). Kernel size can't be greater than actual input size

Synthesize - MelGan: Run out of Memory with CUDA

Env: (Nvidia T4,torch 1.9.0)
Tried the quick start with steps:
$ conda create -n ParallelTTS python=3.7.9
$ conda activate ParallelTTS
$ pip install -r requirements.txt
$ python synthesize.py
--checkpoint ./pretrained/ljspeech-parallel-epoch0100.pth
--melgan_checkpoint ./pretrained/ljspeech-melgan-epoch3200.pth
--input_texts ./samples/english/synthesize.txt
Failed with:
image

train-parallel.py 训练中有错误

Traceback (most recent call last):
File "/home/gaol/codes/Voices/FCH-TTS/train-parallel.py", line 69, in
loggers=loggers
File "/home/gaol/codes/Voices/FCH-TTS/helpers/trainer.py", line 319, in fit
valid_losses = self._validate(valid_loader)
File "/home/gaol/codes/Voices/FCH-TTS/helpers/trainer.py", line 419, in _validate
loss.item(), l1_loss.item(), ssim_loss.item(), drn_loss.item()
AttributeError: 'float' object has no attribute 'item'

MelGan 模型

请问有普通话的bbspeech-melgan-epoch*.pth 预训练模型吗 或者是要跑哪个脚本自己训练

Data Error!!!

dear atomicoo:
There is error url : https://open-speech-data.oss-cn-hangzhou.aliyuncs.com, can not download data when run prepare_dataset.py

and could U pls share the the structure of directory “datasets” ,it's differece between your script
dataset_path = osp.join(datasets_path, dataset_dir)
wavfile_path = osp.join(dataset_path, "wavs")
melspec_path = osp.join(dataset_path, "mels")
and office data of BiaoBei
PhoneLabeling
ProsodyLabeling
Wave

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.