Giter Club home page Giter Club logo

comfyui-xtts's Introduction

ComfyUI-XTTS

a custom comfyui node for coqui-ai/TTS's xtts module! support 17 languages voice cloning and tts

English (en), Spanish (es), French (fr), German (de), Italian (it), Portuguese (pt), Polish (pl), Turkish (tr), Russian (ru), Dutch (nl), Czech (cs), Arabic (ar), Chinese (zh-cn), Japanese (ja), Hungarian (hu), Korean (ko) Hindi (hi)

webpage

Disclaimer / 免责声明

We do not hold any responsibility for any illegal usage of the codebase. Please refer to your local laws about DMCA and other related laws. 我们不对代码库的任何非法使用承担任何责任. 请参阅您当地关于 DMCA (数字千年法案) 和其他相关法律法规.

Features

  • srt file for subtitle was supported
  • mutiple speaker was supported in finetune and inference by srt
  • huge comfyui custom nodes can merge in xtts

How to use

make sure ffmpeg is worked in your commandline for Linux

apt update
apt install ffmpeg

for Windows,you can install ffmpeg by WingetUI automatically

then!

git clone https://github.com/AIFSH/ComfyUI-XTTS.git
cd ComfyUI-XTTS
pip install -r requirements.txt

weights will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface or if you still struggle with huggingface, you may try follow hf-mirror to config your env.

或者下载权重文件解压后把pretrained_models整个文件夹放进ComfyUI-XTTS目录

Tutorial

Demo

Params

  • temperature: The softmax temperature of the autoregressive model. Defaults to 0.65.

  • length_penalty: A length penalty applied to the autoregressive decoder. Higher settings causes the model to produce more terse outputs. Defaults to 1.0.

  • repetition_penalty: A penalty that prevents the autoregressive decoder from repeating itself during decoding. Can be used to reduce the incidence of long silences or “uhhhhhhs”, etc. Defaults to 2.0.

  • top_k: Lower values mean the decoder produces more “likely” (aka boring) outputs. Defaults to 50.

  • top_p: Lower values mean the decoder produces more “likely” (aka boring) outputs. Defaults to 0.8.

  • speed: The speed rate of the generated audio. Defaults to 1.0. (can produce artifacts if far from 1.0)

WeChat Group && Donate

Wechat donate

Thanks

coqui-ai/TTS

comfyui-xtts's People

Contributors

aifsh avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.