Giter Club home page Giter Club logo

zhenhaoge's Projects

auxiliaryasr icon auxiliaryasr

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

espnet icon espnet

End-to-End Speech Processing Toolkit

fairseq icon fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

freevc icon freevc

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

ft-w2v2-ser icon ft-w2v2-ser

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

kaldi icon kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

nemo icon nemo

NeMo: a framework for generative AI

parler-tts icon parler-tts

Inference and training library for high-quality TTS models.

pyctcdecode icon pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.

pyvideotrans icon pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

salmonn icon salmonn

SALMONN: Speech Audio Language Music Open Neural Network

styletts2 icon styletts2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

tacotron2 icon tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

tortoise-tts icon tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

transformer-tts icon transformer-tts

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

transformers icon transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

transformertts icon transformertts

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

vocal-separate icon vocal-separate

an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网

voice-assistant icon voice-assistant

A simple toy demo of a local voice assistant with whisper and large language model.

voicecraft icon voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

yt-dlp icon yt-dlp

A feature-rich command-line audio/video downloader

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.