Giter Club home page Giter Club logo

mmmmichaelzhang's Projects

asteroid icon asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

audiodvp icon audiodvp

AudioDVP:Photorealistic Audio-driven Video Portraits

audiomass icon audiomass

Free full-featured web-based audio & waveform editing tool

auxiliaryasr icon auxiliaryasr

Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

bark icon bark

๐Ÿ”Š Text-Prompted Generative Audio Model

cmgan icon cmgan

Conformer-based Metric GAN for speech enhancement

cross-lingual-voice-cloning icon cross-lingual-voice-cloning

Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.

demucs icon demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

facial icon facial

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

fullsubnet-plus icon fullsubnet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

gpt-sovits icon gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

matchering icon matchering

๐ŸŽš๏ธ Open Source Audio Matching and Mastering

mellotron icon mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

mir-svc icon mir-svc

Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach

mockingbird icon mockingbird

๐Ÿš€AIๆ‹Ÿๅฃฐ: 5็ง’ๅ†…ๅ…‹้š†ๆ‚จ็š„ๅฃฐ้Ÿณๅนถ็”Ÿๆˆไปปๆ„่ฏญ้Ÿณๅ†…ๅฎน Clone a voice in 5 seconds to generate arbitrary speech in real-time

neuralsvb icon neuralsvb

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

parallelwavegan icon parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

pitchextractor icon pitchextractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

randomcnn-voice-transfer icon randomcnn-voice-transfer

Audio style transfer with shallow random parameters CNN. Result: https://soundcloud.com/mazzzystar/sets/speech-conversion-sample

so-vits-svc-fork icon so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

ssr_eval icon ssr_eval

Evaluation and Benchmarking of Speech Super-resolution Methods

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.