Giter Club home page Giter Club logo

ss_synthesis's Introduction

A Vocoder Based Method For Singing Voice Extraction

Pritish Chandna, Merlijn Blaauw, Jordi Bonada, Emilia Gómez

Music Technology Group, Universitat Pompeu Fabra, Barcelona

This repository contains the source code for the paper with the same title. Please note that the model presented here is currently configured just for the iKala dataset, as published in the corresponding paper. We will release a more generalized model soon.

Installation

To install, clone the repository and use
pip install requirements.txt 
to install the packages required.

The main code is in the train_tf.py file. To use the file, you will have to download the model weights and place it in the log_dir_m1 directory, defined in config.py. Wave files to be tested should be placed in the wav_dir, as defined in config.py. You will also require TensorFlow to be installed on the machine.

Data pre-processing

Once the iKala files have been put in the wav_dir, you can run

python prep_data_ikala.py
to carry out the data pre-processing step.

Training and inference

Once setup, you can run the command

python train_tf.py -t
to train or
python train_tf.py -s <filename> -p (optional, for plots)
to synthesize the output.The output will be saved in the val_dir specified in the config.py file. Note that plots are only supported for iKala songs as the ground truth is available for these songs.

Evaluation

Once the file has been synthesized, you can add examples to be evaluated to the sep_eval folder. Then to evaluate, please run

python sep_eval.py
to run the evaluation script. The results will be save in csv format in the file eval.csv.

We are currently working on future applications for the methodology and the rest of the files in the repository are for this purpose, please ignore. We will further update the repository in the coming months.

Acknowledgments

The TITANX used for this research was donated by the NVIDIA Corporation. This work is partially supported by the Towards Richer Online Music Public-domain Archives (TROMPA) project.

ss_synthesis's People

Contributors

pc2752 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.