A Midi(pianoroll) to Performace model based on DDPM

TO DO

Piano Solo(Mastero Dataset) model train
The script for processing the Mastero and Musicnet dataset
Inpainting and Generation Piano Solo based on MIDI
M2P models trained on some multi-instrument data from the Musicnet dataset(String Quartet et.al)
Inpainting and Generation Multi-Instrument based on MIDI
Unconditional Generation model

Requirements

install torch == 2.2.0 (Versions below this will not be able to use Vocoder, and versions above this do not have adapted Lighting) , torchaudio
run

pip install -r requirements.txt

Inference for Midi to Performace

Download pretained M2P model in releases
Download pretained Vocoder in https://github.com/fishaudio/vocoder/releases/download/1.0.0/firefly-gan-base-generator.ckpt
Adjust train.yaml, Notice use Absolute path

evaluation:
    chkpt_path: "/disk2/Opensource-DDPM-M2P/M2P_model/checkpoints/ddpmv2-2048-512-2048-2res-epoch=227-loss=0.0271.ckpt"  #M2P model path
    device: "cuda"
    vocoder_path: "/disk2/Opensource-DDPM-M2P/M2P_model/firefly-gan-base-generator.ckpt"      #Vocoder path
    test_midi_path: "/disk2/DDPM-M2P/gen_example"      #midi path
    data_start: 0
    max_frame: 2048  #Once inference length
    dataset_type: mastero

python main/generation/pianoroll_generation.py

Train Piano Solo M2P model

download mastero dataset and unzip

mastero
├───...
├───2014
│   ├───...wav
│   ├───...midi
│   └───...
├───2015
├───...

Adjust train.yaml

mastero:
  data_path: /disk2/Piano-Solo/mastero    
  save_path: /disk2/Piano-Solo/Processed_data

python main/prepare_data/prepare_mastero_data.py

run (optional setting, just for checking whether the dataset is aligned.)

python main/prepare_data/check_dataset.py

train model (the model can produce comprehensible results after training for 80 epochs)

python main/train_ddpm.py

Something TO DO ...

Reference

Vocoder: https://github.com/fishaudio/vocoder

Musicnet dataset script: https://github.com/bwang514/PerformanceNet

Cite

@INPROCEEDINGS{10095769,
  author={Liu, Kaiyang and Gan, Wendong and Yuan, Chenchen},
  booktitle={ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, 
  title={MAID: A Conditional Diffusion Model for Long Music Audio Inpainting}, 
  year={2023},
  pages={1-5},
  doi={10.1109/ICASSP49357.2023.10095769}}
'''

tps-f / ddpm-midi2performance-model Goto Github PK

ddpm-midi2performance-model's Introduction

A Midi(pianoroll) to Performace model based on DDPM

TO DO

Requirements

Inference for Midi to Performace

Train Piano Solo M2P model

Something TO DO ...

Reference

Cite

ddpm-midi2performance-model's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent