Giter Club home page Giter Club logo

wetts's Introduction

WeTTS

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Install

We suggest installing WeTTS with Anaconda or Miniconda. Clone this repo:

git clone https://github.com/wenet-e2e/wetts.git

For CUDA 10.2, run:

conda create -n wetts python=3.8 montreal-forced-aligner pytorch=1.11 \
torchaudio cudatoolkit=10.2 -c pytorch -c conda-forge

For CUDA 11.3, run:

conda create -n wetts python=3.8 montreal-forced-aligner pytorch=1.11 \
torchaudio cudatoolkit=11.3 -c pytorch -c conda-forge

Installing other dependencies using:

conda activate wetts
python -m pip install -r requirements.txt

Roadmap

We mainly focus on production and on-device TTS, and we plan to use:

  • AM: FastSpeech2
  • vocoder: hifigan/melgan

And we are going to provide reference solution of:

  • Prosody
  • Polyphones
  • Text Normalization

Dataset

We plan to support a variaty of open source TTS datasets, include but not limited to:

  • BZNSYP, Chinese Standard Mandarin Speech corpus open sourced by Data Baker.
  • AISHELL-3, a large-scale and high-fidelity multi-speaker Mandarin speech corpus.
  • Opencpop, Mandarin singing voice synthesis (SVS) corpus open sourced by Netease Fuxi.

Runtime

We plan to support a variaty of hardwares and platforms, including:

  • x86
  • Android
  • Raspberry Pi
  • Other on-device platforms

Acknowledgement

  1. We borrow some code from FastSpeech2 for FastSpeech2 implentation.
  2. We refer PaddleSpeech for feature extraction, pinyin lexicon preparation for alignment, and the length regulator in FastSpeech2.

wetts's People

Contributors

robin1001 avatar unrea1-sama avatar zpcoftts avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.