Giter Club home page Giter Club logo

谷下雨's Projects

asrt_speechrecognition icon asrt_speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

chatgpt-on-wechat icon chatgpt-on-wechat

Wechat robot based on ChatGPT, which using OpenAI api and itchat library. 使用ChatGPT搭建微信聊天机器人,基于GPT3.5/4.0 API实现,支持个人微信、公众号、企业微信部署,能处理文本、语音和图片,访问操作系统和互联网。

diffae icon diffae

Official implementation of Diffusion Autoencoders

dvector icon dvector

Speaker embedding (d-vector) trained with GE2E loss

ekho icon ekho

Chinese text-to-speech engine

fastspeech2 icon fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

fragmentvc icon fragmentvc

Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention

hifi-gan icon hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

hifigan icon hifigan

An 16kHz implementation of HiFi-GAN for soft-vc.

loinvc icon loinvc

Robust Feature Decoupling in Voice Conversion by using Locality-Based Instance Normalization

mae-vc icon mae-vc

Voice Conversion Based on Learnable Similarity-Guided Masked Autoencoder

mediumvc icon mediumvc

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

singlevc icon singlevc

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

so-vits-svc-fork icon so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

srd-vc icon srd-vc

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

tensorflowtts icon tensorflowtts

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

ttslearn icon ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

wav2lip icon wav2lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.