Giter Club home page Giter Club logo

crnn_multitask_adt's Introduction

Evaluating a multi-task and a single-task CRNN model for Automatic Drum Transcription

This notebook is part of my final project for the Music Information Retrieval course of the Master in Sound and Music Computing from Universitat Pompeu Fabra in Barcelona. The report accompanying the notebook can be found here.

Summary

Multi-task learning can improve the generalization and robustness of data-driven models, assuming a statistical relationships between the different tasks. Recent approaches to automatic drum transcription (ADT) have exploited the locally periodic nature of Western drum performances by training ADT systems jointly with the task of beat and downbeat detection. While most ADT vocabularies contain only the three most common drum instruments (kick-drum, snare-drum and hi-hat), training such models for larger vocabulary transcriptions encounters the difficulty that the available datasets do not offer a significant frequency of appearance of these less usual drum instruments. A recent approach by Cartwright and and Bello trains a CRNN model with a synthetic dataset along with other existing datasets in order to balance the instrument occurrences. In this project, I compare the performance of the multi-task version of this CRNN model and its single-task version on the Groove MIDI Dataset.

Setup environment

The jupyter notebook uses a conda environment that can be easily set up by running the following commands in the terminal:

CONDA_RESTORE_FREE_CHANNEL=1 conda env create -f environment.yml

and then:

conda activate rhythm

crnn_multitask_adt's People

Contributors

pelinski avatar

Stargazers

Eevee Zayas-Garín avatar Paschalis Melissas avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.