Giter Club home page Giter Club logo

ws-choi / ismir2020_u_nets_svs Goto Github PK

View Code? Open in Web Editor NEW
77.0 5.0 9.0 109.44 MB

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

License: MIT License

Jupyter Notebook 99.74% Python 0.26%
singing-voice-separation musdb18 pytorch source-separation ismir2020

ismir2020_u_nets_svs's People

Contributors

ws-choi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

ismir2020_u_nets_svs's Issues

Training time

Hello, Thank you for your sharing codes.

Could you please tell me how long does it take you to train this model (TFC_TDF_Net (large))?
In my setup, one 2080ti, batch_size=8, it takes 12 minutes for every epoch. It means that I need to wait 17 days for 2000 epochs. It looks like unreasonable. Is this the same as your training time?

Looking forward to you.

Volume issue and glitches

Hello, thanks for good example repository!

Is there any way to fix lags in model output? I mean two things:

  1. much part of realtive volume is changed. Some pieces of song sounds in increased volume, relative to all track, comparing to original. Some times it's sound very glitchy, so sounds that were very quet in original, now very loud and much louder than rest of track.
  2. Some strange sound in output, like glitches. Like some non voice sounds from original track, but in increased volume and transformed sound. Sometimes it sound scarry )))

Tranks in advance!

ImportError: cannot import name 'EvalResult' from 'pytorch_lightning'

Hello, I encountered this error when configuring the environment. How can I solve it?
Traceback (most recent call last):
File "main.py", line 7, in
from source_separation.models.scripts import trainer, evaluator
File "/home/Music/ISMIR2020_U_Nets_SVS-master/source_separation/models/scripts/trainer.py", line 8, in
from source_separation.models.model_definition import get_class_by_name
File "/home/Music/ISMIR2020_U_Nets_SVS-master/source_separation/models/model_definition.py", line 3, in
from source_separation.models.tdc_net import TDC_NET_Framework
File "/home/Music/ISMIR2020_U_Nets_SVS-master/source_separation/models/tdc_net.py", line 8, in
from source_separation.models.dense_u_net import Dense_UNET_Framework, Dense_UNET
File "/home/Music/ISMIR2020_U_Nets_SVS-master/source_separation/models/dense_u_net.py", line 9, in
from source_separation.models.separation_framework import Spectrogram_based
File "/home/Music/ISMIR2020_U_Nets_SVS-master/source_separation/models/separation_framework.py", line 12, in
from pytorch_lightning import EvalResult
ImportError: cannot import name 'EvalResult' from 'pytorch_lightning' (/opt/conda/lib/python3.7/site-packages/pytorch_lightning/init.py)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.