shirly-24 Goto Github PK

followers: 1.0 following: 28.0 repos: 26.0 gists: 0.0

Type: User

shirly-24's Projects

audioldm

AudioLDM: Generate speech, sound effects, music and beyond, with text.

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

audiomae

This repo hosts the code and models of "Masked Autoencoders that Listen".

avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

awesome-diffusion-models

A collection of resources and papers on Diffusion Models

awesome-singing-voice-synthesis-and-singing-voice-conversion

A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).

awesome-voice-conversion

A curated list of awesome voice conversion, projects and communities.

bigvgan

Official PyTorch implementation of BigVGAN (ICLR 2023)

conference-accepted-paper-list

Some Conferences' accepted paper lists (including AI, ML, Robotic)

cvpr2023-papers-with-code

CVPR 2023 论文和开源项目合集

ddsp_pytorch

Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch

deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

fa-gan

fakebob

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

istft-avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

ldl

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

neuralspeech

paper-reading

深度学习经典、新论文逐段精读

parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

s3prl

Audio Foundation Models (Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit)

unganable

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

shirly-24 Goto Github PK

shirly-24's Projects

Recommend Projects

Recommend Topics

Recommend Org