caochenbin Goto Github PK
Type: User
Type: User
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
Unofficial implementation of Dual-Path Transformer Network (DPTNet) for speech separation (Interspeech 2020)
This project gives an example of dual microphone speech enhancement based on GSC beamformer and multiple channel postfilter.
End-To-End Deep Learning-based Adaptation Control for Linear Acoustic Echo Cancellation
This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which was submitted to ICASSP2022.
C++ and MATLAB code for fast and accurate fundamental frequency estimation
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by Elsevier Applied Acoustics.
Real-time GCC-NMF Blind Speech Separation and Enhancement
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
The official implementation of GTCRN, an ultra-lite speech enhancement model.
The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning"
A Python library for blind source separation.
Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)
Spectral Subtraction, Wiener Filtering, MMSE
Microphone Array Speech Processing
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awards)
Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICML 2023)
MetricGAN+ PyTorch Implementation
We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.