Giter Club home page Giter Club logo

conv-tasnet's Introduction

ConvTasNet

A PyTorch implementation of the TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation

Requirements

see requirements.txt

Usage

./nnet/separate.py /path/to/checkpoint --input /path/to/mix.scp --gpu 0 > separate.log 2>&1 &
  • evaluate
./nnet/compute_si_snr.py /path/to/ref_spk1.scp,/path/to/ref_spk2.scp /path/to/inf_spk1.scp,/path/to/inf_spk2.scp

Result (on best configuratures in the paper)

ID Settings Causal Norm Param Loss Si-SDR
0 adam/lr:1e-3/wd:1e-5/32-batch/2gpu N BN/relu 8.75M -17.59/-15.45 14.63
1 adam/lr:1e-2/wd:1e-5/20-batch/2gpu N gLN/relu - -16.09/-15.21 14.58
2 adam/lr:1e-3/wd:1e-5/20-batch/2gpu N gLN/relu - -17.91/-16.54 15.87
3 adam/lr:1e-2/wd:1e-5/32-batch/2gpu N BN/sigmoid - -14.51/-13.40 12.62
4 adam/lr:1e-2/wd:1e-5/32-batch/2gpu N BN/relu - -17.20/-15.38 14.58
5 adam/lr:1e-3/wd:1e-5/20-batch/2gpu N gLN/sigmoid - -17.20/-16.11 15.55
6 adam/lr:1e-3/wd:1e-5/32-batch/2gpu Y BN/relu - -15.25/-12.47 11.42
7 adam/lr:1e-3/wd:1e-5/24-batch/2gpu N cLN/relu - -18.72/-16.17 15.25

Reference

Luo Y, Mesgarani N. TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation[J]. arXiv preprint arXiv:1809.07454, 2018.

conv-tasnet's People

Contributors

funcwj avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.