georgid Goto Github PK

followers: 58.0 following: 3.0 repos: 49.0 gists: 2.0

Name: Georgi Dzhambazov

Type: User

Company: Music Technology Group

Bio: I bring technology closer to the society by creating innovative, AI-enabled, software solutions - mainly in the domains of music, speech and natural language

Location: Barcelona

Blog: https://www.linkedin.com/in/georgi-d-/

Georgi Dzhambazov's Projects

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

alignmentevaluation

Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if token is word, phrase, note, section etc.) User for the evaluation of the MIREX Lyrics-to-audio challenge

chorus-vocal-covers

A heuristic approach to the detection of choruses in vocal cover versions. Done at the WiMIR workshop at ISMIR 2019 https://docs.google.com/presentation/d/1WYXxChgo8DI_NknyndPdcOuxlhAL_428Olg2fWaVy6U/edit#slide=id.g6f4f88d309_0_57

curation_users

docker

My Docker scripts and Dockerfile for several frameworks.

drumtranscriptionwithbarpointer

matlab prototype for drum transcription sytem described in the paper https://drive.google.com/file/d/0B4bIMgQlCAuqdGVRbVNNbzJfeUU/view

dunya

The Dunya music browser. Developed using Django 2

englishmlp2turkish

scripts to create mapping from English phoneme models as feed forward network multilayer perceptron network onto a GMM turksih phoneme model

enst-drums-dataset

the dataset used in the paper https://drive.google.com/file/d/0B4bIMgQlCAuqdGVRbVNNbzJfeUU/view

essentia

C++ library of algorithms to extract features from audio files, including Python bindings.

hmmduration

Python Hidden Markov Models framework. Adapted for computationally optimal Viterbi forced alignment. Added Explicit Duration model

htkmodelparser

Parses models created by the HTK Toolkit (http://htk.eng.cam.ac.uk/) as text files into Python class. It enables then various operations with the models like visualization and comparison.

imfcc-visualization

Example of the inverse MFCC essentia feature

intervalmagic

intervaltrainer

smart-phone game app that teaches a technique to memorise music intervals. check out demo video:

jingjualignment

Lyrics-to-Audio Alignment for Jingju Arias

lakh_vocal_segments_dataset

singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/

lyrics2audioaligner

lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping

lyrics2audiodtw

matlab version of lyrics2audio using DTW. experimental

madmom

Python audio and music signal processing library. This is a fork adding support for synchronous tracking of vocal note onsets and metrical position in bar. The model used is Dynamic Bayesian Networks.

makam_acapella

acapella recordings of Makam

meowify

mfcc-htk-an-librosa

Reproduce the htk-type of MFCC features using the essentia framework. The MFCC extracted with essentia are compared to these extracted with htk these extracted with librosa

msaf

Music Structure Analysis Framework

music_hack_sofia

Examples of extracting acoustic features with essentia

musicbrainztools

Tools to match MB to other datasets/collections (like echonest)

otmm_vocal_segments_dataset

Manual annotations of audio segments that correspond to sections from score with singing voice present

pdnn

PDNN: A Python Toolkit for Deep Learning. http://www.cs.cmu.edu/~ymiao/pdnntk.html

phdthesis

The data needed to generate my phd thesis

georgid Goto Github PK

Georgi Dzhambazov's Projects

Recommend Projects

Recommend Topics

Recommend Org