Giter Club home page Giter Club logo

caochenbin's Projects

dpcrn_dns3 icon dpcrn_dns3

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

eabnet icon eabnet

This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.

fastf0nls icon fastf0nls

C++ and MATLAB code for fast and accurate fundamental frequency estimation

fullsubnet icon fullsubnet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

gagnet icon gagnet

This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by Elsevier Applied Acoustics.

gcc-nmf icon gcc-nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

gpurir icon gpurir

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

gtcrn icon gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

hgcn icon hgcn

The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"

intelligibility-metricgan icon intelligibility-metricgan

Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"

libss icon libss

A Python library for blind source separation.

masp icon masp

Microphone Array Speech Processing

melgan-neurips icon melgan-neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

metricgan icon metricgan

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awards)

metricgan-okd icon metricgan-okd

Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICML 2023)

mha-dpcrn icon mha-dpcrn

We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN

mp-senet icon mp-senet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

mtfaa-net icon mtfaa-net

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

multimodal icon multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.